Detecting Long-Duration Narrow-Band Gravitational Wave Transients Associated 
with Soft Gamma Repeater Quasi-Periodic Oscillations 
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We have performed an in-depth concept study of a gravitational wave data analysis method which 
targets repeated long quasi-monochromatic transients (triggers) from cosmic sources. The algorithm 
concept can be applied to multi-trigger data sets in which the detector-source orientation and the 
statistical properties of the data stream change with time, and does not require the assumption 
that the data is Gaussian. Reconstructing or limiting the energetics of potential gravitational wave 
emissions associated with quasi-periodic oscillations (QPOs) observed in the X-ray lightcurve tails 
of soft gamma repeater flares might be an interesting endeavour of the future. Therefore we chose 
this in a simplified form to illustrate the flow, capabilities, and performance of the method. We 
investigate performance aspects of a multi-trigger based data analysis approach by using O(lOOs) 
long stretches of mock data in coincidence with the times of observed QPOs, and by using the 
known sky location of the source. We analytically derive the PDF of the background distribution 
and compare to the results obtained by applying the concept to simulated Gaussian noise, as well as 
off-source playground data collected by the 4-km Hanford detector (HI) during LIGO's flfth science 
run (S5). We show that the transient glitch rejection and adaptive differential energy comparison 
methods we apply succeed in rejecting outliers in the S5 background data. Finally, we discuss how 
to extend the method to a network containing multiple detectors, and as an example, tune the 
method to maximize sensitivity to SGR 1806—20 flare times. 

PACS numbers: 95.85Sz, 95.30.Sf, 95.55.Ym 



I. INTRODUCTION 



Soft gamma repeaters (SGRs) are mostly galactic ob- 
jects that emit short bursts of X-ray and gamma radia- 
tion at irregular intervals. At times these sources emit 
exceptionally energetic flares that last hundreds of sec- 
onds, with peak luminosities of up to 10*^ erg s~^ [ij . The 
most luminous SGR flare observed to date was recorded 
on 27 December 2004, and originated from SGR 1806-20 
Quasi-periodic oscillations (QPOs) were observed at 
late times in the lightcurve tail by the Rossi X-Ray Tim- 
ing Explorer (RXTE) and the Ramaty High Energy 
Solar Spectroscopic Imager (RHESSI) satellites, most 
notably at 92.5 Hz and 626.5 Hz [jQ. Although the ex- 
act timing of the QPO signals is unclear since the peak 
flare saturated both detectors, the signals can be unam- 
biguously observed beginning approximately 170 — 200 s 
after the initial flare [l|, Q • 

The ~ 8 s period of the lightcurve tail observed during 
the December 2004 hyperflare 0, as well as the phase- 
dependence of the QPO features, suggests that the parent 
object might be a type of slowly rotating neutron star. 
In this model the phase dependence can be interpreted 
as localization to a specific region of the star's surface. 
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and the observed frequencies may be related to particular 
seismic modes [HQi making QPOs an interesting probe 
of the astroseismology of SGRs. In particular, if QPOs 
are indeed related to seismic vibrations of a neutron star, 
then we can estimate the thickness of the star's crust, and 
possibly infer what type of material makes up the star's 
interior fE\ . It has also been suggested that changes in the 
observed QPOs over time may be indicative of physical 
changes in the source, such as a decaying magnetic field 

i- 

The magnetar model f?\ proposes that SGRs are a type 
of highly magnetized NS with a surface magnetic field in 
excess of 10^^ G. In this model QPOs can be explained 
by a starquake event, in which the star's magnetic field 
becomes twisted and then violently rearranges, exciting 
crustal or global seismic modes and releasinggamma ray 
bursts with the observed QPO structures [IJ^. Numeri- 
cal studies in which the magnetar is treated as a coupled 
system consisting of an elastic crust and a magnetohy- 
drodynamic fluid core have demonstrated a number of 
interesting effects. These include the dynamical gener- 
ation of instabilities and asymmetric quasi-equilibria in 
the magnetic field [lOl, as well as a coupling between 
discrete crustal modes and continuous bands of global 
Alfven modes, with QPOs appearing in the late time star- 
quake oscillations near the edges of the Alfven spectral 
bands plI - fTsj . This numerical work also suggests that 
some QPOs may be exceptionally strong because they are 
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close in frequency to one of the crustal modes [l2l. [isl. [l5| . 
The same dynamics may also drive the emission of gravi- 
tational waves (GWs) by exciting the rnagnetar's seismic 
modes; in particular, crustal f-modes [16|. Although it 
is argued that at realistic values of the magnetic field 
strength f-modes are unlikely to be sufficiently excited to 
produce an observable signal in second-generation grav- 
itational wave detectors [l7l . [T8| , Zink et al. note that 
low-frequency modes up to 100 Hz remain unstudied, and 
may be sufficiently excited to produce detectable GW sig- 
nals [13. 

A number of searches for gravitational radiation associ- 
ated with SGRs and magnetars have been conducted. In 
particular, a search for GWs associated with f-mode ring- 
downs in six magnetars targeting the frequency ranges 
1 - 3 kHz, 100 - 200 Hz, and 100 - 1000 Hz, with time 
windows of ^ 200 ms, found no evidence of a GW signal 
[19']. Other searches targeting short-duration 0.3 s) 
GWs from SGRs ^] as well as stacked GW emissions 
associated with the SGR 1900-1-14 storm event [2lj also 
found no evidence of GWs associated with SGR bursts at 
energies accessible by past interferometric gravitational 
wave detectors. 

Our work extends a search algorithm developed by 
Matone and Marka [13], targeting narrow-band, long- 
duration GW signals associated with SGR QPOs. This 
algorithm is based on an excess energy method and thus 
does not presume any particular waveform for the emit- 
ted GW signal. In addition, the algorithm also does not 
require exact knowledge of the timing of the GW signal 
relative to the electromagnetic trigger; we choose con- 
servative O (100 s) long time windows encompassing the 
entire fiare event. The algorithm was previously applied 
to LIGO data recorded during the December 2004 SGR 
1806—20 hyperflare event to constrain the energetics of 
any associated GW signal, and set a lowest bound of 
7.67 X 10*^ erg on the total GW emissions associated with 
the QPO at 92.5 Hz [23*1. A similar algorithm target- 
ing long-duration GW transients using a complementary 
cross-power technique is discussed in [2J1 , and is also rel- 
evant to searches for GWs from SGRs in the LIGO data. 

The LIGO network of interferometric gravitational 
wave detectors has completed 6 science runs to date. Sci- 
ence runs 5 and 6 lasted from November 4th, 2005 to 
September 30th, 2007, and from July 7th, 2009 to Oc- 
tober 20th, 2010, respectively. During S5 alone ^ 150 
EM flares originating from SGR 1806—20 were observed. 
If SGR flares are indeed coupled to seismic vibrations 
of a neutron star as suggested by the magnetar model, 
one might expect that each burst should excite the same, 
characteristic frequencies of the star's crust regardless of 
the luminosity of the fiare, leading to repeated GW emis- 
sions at the same frequencies. As a result, a comprehen- 
sive algorithm analyzing all triggers and QPO frequencies 
overlapping with the LIGO data set for evidence of re- 
peated GW emissions from SGR fiares might prove itself 
more sensitive than an algorithm targeting a single burst 
or QPO frequency. Although we illustrate the concept 



through a 10 Hz frequency band centered on the 92.5 Hz 
QPO observed during the December 2004 SGR 1806-20 
hyperfiare, the algorithm can be applied to any frequency 
window of interest, allowing the method to target each 
of the observed QPO frequencies. 

The algorithm concept presented has been designed 
to efliciently process large, multi-detector data sets con- 
taining multiple SGR flares from a single source, and 
includes a number of data quality measures intended to 
account for nonstationarity and deviations from Gaus- 
sianity in the data stream of a physical detector operating 
for months or years. The method, like the methods men- 
tioned above, benefits from knowledge of the arrival time 
of an electromagnetic counterpart to the hypothesized 
GW signal, as well as knowledge of the sky position of the 
source. In addition, we further assume that the recurrent 
SGR bursts contain the same QPO structures and seis- 
mic modes observed in hyperflares, and introduce a new 
detection criterion based on the two-sample Kolmogorov- 
Smirnov (K-S) test and Fisher's method, allowing us to 
increase the sensitivity of the algorithm relative to an- 
alyzing a single event. We also discuss the problem of 
variable detector orientation for each trigger, and dis- 
cuss how to optimize the concept accordingly. We show 
that with the proper data conditioning procedures the 
behaviour of real data can be modeled well mathemat- 
ically and approximated numerically through gaussian 
statistics. 

The paper is organized as follows: in Sec. |ll] we dis- 
cuss the algorithm and analytically derive the PDF of 
the background distribution for Gaussian white noise. In 
Sec. mil we discuss the implementation of the algorithm 
as a concept pipeline and data quality measures. Sec. IIVI 
verifies the implementation by applying the pipeline to 
randomly generated Gaussian noise and comparing the 
results to the predictions of the mathematical model. Fi- 
nally, in Sec. |V] we present a concept study of the off- 
source detector background and orientation using play- 
ground data segments from LIGO's fifth science run |25| . 



II. MATHEMATICAL MODEL OF THE 
ALGORITHM 

The time domain algorithm splits the data stream 
by bandpass filtering into three channels of a specific 
bandwidth motivated by X-ray observation constraints 
A/: the QPO channel, which is centered on the QPO 
frequency, and the up and down channels, centered at 
/up = /qpo + A/ and /down = /qpo - A/, respectively. 
The bandwidth A/ for example can be derived from the 
measured full width at half maximum (FWHM) of the 
QPOs observed in the EM (X-ray) emissions ,35]. We 
define the energy of a strain time series ^(i) in an inter- 
val At to be 

E = [ dt\^{t)\\ (2.1) 

J At 
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We can compare the energy contained in the QPO band 
to the energy contained in the adjacent bands during an 
interval At by computing the excess energy 



QPO 
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where w is a locaUy determined averaging factor [36|. 
Defining the excess energy in this manner ahows us 
to isolate the contribution of a GW signal associated 
with the QPO to the total energy of the data stream 
while simultaneously suppressing broadband instrumen- 
tal noise. It is also necessary to account for temporal 
non-stationarity in the data stream since we perform a 
comprehensive analysis with data spanning several years, 
during which changes in the properties of the detector 
noise are too significant to be ignored. We accomplish 
this by computing the excess energy in the time inter- 
val (tccntcr , ^center + At) , the cxccss energy in the time 
interval (tccntcr — At , tcenter), and then computing the 
difference in excess energy 
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resulting in a quantity which can be compared directly 
across different times. For on-source measurements we 
choose tcenter to be the time of an observed trigger [37{ . 
and for off-source measurements tccntcr is randomly sam- 
pled from stretches of data disjoint from the region hy- 
pothesized to contain a GW signal. The value of At is 
chosen based on the timescale of the SGR flares, and 
is the same for both on-source and off-source measure- 
ments: since we do not, in general, have detailed in- 
formation regarding the length of every SGR burst, we 
choose At to be the full duration of the longest observed 
lightcurve tail from a given SGR as an illustration here. 
For SGR 1806-20 this is At = 300 s based on the De- 
cember 2004 hyperflare. This choice of At is intended to 
be conservative to ensure that we capture associated GW 
signals. The method for calculating /\E°^^°^^ is depicted 
schematically in Fig. 

By considering each sampling point in the data stream 
as an independent, identically distributed normal random 
variable we can construct a theoretical model for the al- 
gorithm. In the following section we derive an explicit 
formula for the probability density function of ^E^^"^"^^^. 
These results are then used in Sec. llVl to check the algo- 
rithm's performance by comparing numerical simulations 
to the theoretical results. 



A. Sampling Distribution of AE 



excess 



the discrete analogue of (|2.1 
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where Eg is the sampling frequency and n ~ FgAt is 
the total number of sampling points. E follows a y^- 
distribution with probability density function (PDF) |26| 
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Assuming that the channels are uncorrelated and the sig- 
nal energy is distributed isotropically in frequency space 
it suffices to choose w = 1/2 in (|2.2p . and we can then 
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where each of the terms has probability density function 
([23)) . The PDF of A£"=''™"" can be found using an iden- 
tity for characteristic functions (here denoted by ip) [131 
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The characteristic function associated with (|2.5p is 
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so, using (|2.7p and the inverse Fourier transform we find 
that 
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(2.9) 
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The cumulative distribution function (CDF) is 

f^ABoxoo^. (x) = (2.10) 



— — f du 
27r J 

— oo — oo 



COS {su) 



■i/2 



l + frs' 



Note that the CDF can similarly be calculated for any 
other values of w. 



Here we model the strain noise of the detector by band- 
limited Gaussian white noise for which each sampling 
point, ^i, is an independent [ssj Gaussian random vari- 
able with zero mean and standard deviation a. The en- 
ergy of the strain noise in a time interval At is given by 



III. DESCRIPTION OF THE PIPELINE 

The pipeline is designed to sequentially process the 
LIGO data stream in the time domain and take AE^^'^"^^ 
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measurements around times associated with SGR flares. 
We distinguish between flare measurements and back- 
ground measurements. Flare measurements are per- 
formed at the time of an observed trigger, whereas back- 
ground measurements are performed at times before or 
after a trigger and used to characterize the detector back- 
ground. Fig. [T] depicts a schematic of the algorithm. 
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FIG. 1: A block diagram illustrating the steps in the analy- 
sis pipeline concept. The data conditioning step is discussed 
in Sec. IIII Al Data quality measures are intended to account 
for variability in the quality of data recorded by a physical 
instrument and are discussed in Sec. IIII Bl The injection step 
is optional, and is used to place an upper limit in the event 
that the pipeline fails to make a detection. 



A. Data Conditioning 

The ingoing data stream is conditioned by applying a 
series of three zero-phase, bandpass, Butterworth filters, 
resulting in the upper, lower, and QPO channels. Suf- 
ficiently large buffer time intervals are included and cut 
off after filtering at both ends of the data stream to make 
sure that no transients remain in the data due to filter- 
ing. The difference in excess energy is then calculated 
according to (12. 3p . Injections are performed by gener- 
ating the plus and cross components {t) and (t), 
respectively) of the simulated GW signal with a given 
root-sum-square strain (/iiss) 




hi (t) 



hl{t)) 



(3.1) 



and adding it to the input data prior to the data condi- 
tioning step. We can write 



h (t) ^F+-h+ (t) + Fx • hy. (t) 



(3.2) 



where and and -Fx are antenna factors determined 
by the orientation of the detector relative to the source 
at the time of a given trigger and the choice of the 
GW polarization frame. The antenna factors satisfy 
< , Fx < 1 and encode the directional sensitivity 
of the detector to an incoming plus- (F+) or cross- (Fx) 
polarized GW The injection process is performed 

once per trigger, resulting in a simulated foreground dis- 
tribution that can be used to determine the sensitivity of 
the algorithm. 

The root-sum-square strain measured at the detec- 
tor (hj-ss) can be translated into a total physical energy 
(^Gw) ^^^^ source would emit in GWs if the emission 
was isotropic f23| : 



Using a list of the start and end times of detector lock 
stretches that contain one or more astrophysical triggers, 
the pipeline sequentially processes each trigger, perform- 
ing a single measurement at the time of the trigger, and a 
fixed number of background measurements before and af- 
ter the trigger. For each measurement the data is filtered 
to isolate the QPO band and the two adjacent frequency 
bands. The algorithm then applies data quality measures 
to the filtered data streams as described in Sec. IIII Bl and 
computes ^E°^'^°^'^ as defined by (|2.3I) . After iterating 
over the full list of triggers the resulting background and 
flare distributions are compared using the two-sample K- 
S test [28], and detection is claimed if the distributions 
are distinguishable with 5a confidence. Otherwise, we 
inject a simulated waveform of known energy into a sin- 
gle pre-fiare background measurement corresponding to 
each trigger to determine the sensitivity — defined to be 
the minimum injected energy such that the background 
and foreground distributions meet our detection criteria 
— and set an upper limit on the energy of any GW signal 
contained in the data stream. 
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(3.3) 



where d is the distance between the detector and the 
source, and /o is the central frequency of the GW signal 
[39| . In the event of a non-detection this relation allows 
us to use the measured sensitivity to constrain the ener- 
getics of GW emissions at the source. 



B. Data Quality 

A number of measures have been implemented to ac- 
count for instrumental transients and the variability in 
data quality across long stretches of time. We rely pri- 
marily on three techniques: (i) the application of data 
quality flags, (ii) transient glitch rejection, and (iii) the 
use of a locally-determined weighting factor w to measure 
the excess energy (defined by (|2.2p V 

1. Data quality fiags are constructed by the LSC's 
detector characterization working group to charac- 
terize noise contamination in the detector's data 
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stream, and mark the times of known environmen- 
tal noise sources and detector calibration issues. 
The flags are divided into categories 1, 2, 3, and 
4 based on severity and duration. Here we reject 
any data marked with a flag designated category 1 
— times when the detector was not taking data in 
the design configuration — or category 2 — times 
when well-understood instrumental glitches were 
observed. We also avoid standard and significant 
line features while selecting the frequency bands, 
and we propose to apply lock-in filtering techniques 
to mitigate the effects of any potentially unavoid- 
able line features. Detailed descriptions of the flags 
can be found in [s^ . 

2. Transient glitch rejection provides a second method 
for reducing noise contamination. Since we expect 
a long-duration signal lasting hundreds of seconds 
we can safely reject transient glitches over much 
smaller time scales. This is accomplished using 
a method similar to the glitch veto algorithm dis- 
cussed in [2^: we divide the data stream of each, 
filtered channel into tiles of length At, typically 
of order Is or less, compute distributions of the 
energy per tile for each of the three channels, and 
recursively veto any tiles with energy more than ±4 
standard deviations from the mean tile energy. We 
then divide each channel by a duty factor 



DF = 1 



length (veto) 
length (data) 



(3.4) 



to renormalize the remaining data in this example 
[40| . The veto is applied to the strain time series of 
each filtered channel independently since glitches 
are not generally distributed evenly in frequency 
space. This process is iterated until either all out- 
liers have been removed or the fraction of data ve- 
toed passes a cutoff value, typically chosen to be 
10%, at which point the data segment is rejected 
as unreliable. 

3. Another source of data contamination arises from 
the curvature of the LIGO noise floor. Naively 
choosing w ~ 1/2 in the definition of the excess 
energy statistic (|2.2p results in a quantity that 
does not properly account for differences in the 
background noise across multiple frequency bands. 
Since the properties of the detector noise change 
with time we are forced to calculate a n ap propri- 
ate value of w for each measurement [4l|. With 
this in mind we define a local weighting factor w 
by averaging the four quantities 
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calculated by summing the energies of each channel 
in the time interval [t^ — At, + At] for each of 
the four times = ±tw,i,i-tw.2- This is depicted 
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FIG. 2: A schematic illustrating the regions of the time-series 
used to perform a measurement with a locally de- 

termined weighting factor w. The measurement is centered 
about a time tccntcr which is either the time of an observed 
SGR trigger or randomly sampled from the background data. 
The value of w is obtained by calculating the ratio (|3.5p us- 
ing the data contained in the interval [t^n — At, t^ + At] for 
each of the four times t^^ — it^^i, ±tw^2- These four values 
are averaged to estimate w, and then jg computed 

using this value in 1)2. 2|) and (|2.3p . The part of the diagram 
corresponding to /\^<=''<:<=ss ig divided into a solid region and 
a dashed region to emphasize that it is the difference in ex- 
cess energy between these two regions which we ultimately 
calculate using the locally determined w. 

We choose At to be the same time interval used 
for computing excess energy measurements, and 
iuj.i and ^^,.2 to be the minimum adjacent times 
which ensure that none of the measurements over- 
lap. With this scheme we can use the data immedi- 
ately preceding and following the region used for a 
given ^E'^^'^^^^ measurement to determine the ap- 
propriate value of w in (j2.2l) . Defined in this way, 
w accounts for differences between channels due to 
the shape of the noise fioor, as well as changes in 
the noise floor over time. 



IV. GAUSSIAN NOISE SIMULATIONS 

In order to validate the concept and test the mathe- 
matical model of section |lT] we simulate the result of ap- 
plying the pipeline to a set of detected SGR 1806—20 
flares coinciding with the data from LIGO's fifth sci- 
ence run (S5). This is accomplished by configuring the 
pipeline to perform a realistic study but replacing the 
LIGO data stream with stretches of randomly generated 
Gaussian white noise. This is not unreasonable as the 
data quality cut studies indicated that the " cleaned" data 
is very similar to a Gaussian random set. 



A. Simulated Detector Background 

We construct a background sampling distribution by 
calculating ^E°^'^°^^ for simulated data consisting of 240 
seconds of randomly generated Gaussian white noise with 
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strain spectral noise density n = IQ-^^Hz-i/^. The 
value of fi is chosen so that the simulated noise is simi- 
lar in magnitude to the LIGO noise floor during S5. We 
choose /qpo = 92.5 Hz and A/ = 10 Hz, corresponding 
to the frequency and FWHM of the strongest QPO signal 
observed in the lightcurve tail of the 27 December, 2004 
SGR 1806-20 hyperflare event, and At = 30s This 
value of At is shorter than the ~ 300 s duration of the 
lightcurve tail observed during the SGR 1806—20 hyper- 
flare, nevertheless for the purposes of a concept study, 
is less computationally expensive while providing essen- 
tially the same insight. We also compute the theoretical 
PDF of the background distribution using the same pa- 
rameters to compare the theoretical result (|2.9|) to the 
simulation. 

In anticipation of later work utilizing LIGO data, we 
also construct a second background distribution incor- 
porating a simulated veto step to demonstrate that our 
algorithm is robust with respect to the inclusion of data 
quality vetoes. This is accomplished by randomly remov- 
ing 5% of the sampling points from the strain time series 
of each, filtered channel independently, and then apply- 
ing a duty factor (|3.4p to each channel. We also obtain 
a fitted PDF by using nonlinear least squares regression 
to fit the theoretical CDF ((2l0)) to the empirical CDF of 
the resulting background distribution. Fig. [3] depicts the 
results of these simulations. 
In order to characterize the goodness of fit we compute 
the mean percentage error (MPE) 



MPE = 
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FIG. 3: Detector background with and without a simulated 
data quality veto step. The parameters used to generate the 
Gaussian noise are n = 491520 and h = 10~^^ Hz"^''^, where 
n = FsAt is the total number of sampling points used to make 
a measurement and n is the strain spectral density 

of the noise. The upper plot shows the agreement between the 
detector background with no vetoes (histogram) and the theo- 
retical PDF (solid curve) obtained by numerically integrating 
(|2.9p . The lower plot shows a fit of the theoretical PDF to 
the modified detector background after incorporating simu- 
lated data quality vetoes. The parameters of the fit, obtained 
using nonlinear least-squares regression to fit the theoretical 
CDF (|2.10p to the empirical CDF of the resulting detector 
background, are n = 493285 and n = 10"^^ Hz-^/^ 



where {xi} is the set of observations which make up the 
background sampling distribution, n is the number of ele- 
ments of {xi}, F (x) is the empirical CDF associated with 
{xi}, and F (x) is the theoretical (no veto) or fit (veto) 
CDF obtained from (|2.10p . For the no veto simulation 
we find 



MPEn 



0.0125 



(4.2) 



indicating that the theoretical PDF/CDF obtained by 
numerically integrating ()2.9|) / (|2.10|) with the same pa- 
rameters used to generate the simulated Gaussian noise 
accurately describes the simulated detector background. 
The MPE for the veto simulation is 



MPE„ 



0.0196 



(4.3) 



indicating that the fit of the theoretical model to the 
simulated detector background with a veto step is also 
accurate. In addition, we observe that the fit pa- 
rameters (nvoto,"-voto) = (493285, 10~22Hz"^/^) for the 
veto case are within 0.4% of the theoretical values 
(n„o veto, "no veto) = (491520, 10-22 Hz-1/2) for the no 
veto case, demonstrating that the influence of the veto 
step on the detector background distribution is small. 



B. Sensitivity 

The sensitivity is determined by injecting wave- 
forms with given h^-ss into the background described in 
Sec. lIV Al Though in general we would expect a GW sig- 
nal from a magnetar to be elliptically polarized p!6| . for 
simplicity's sake we have used sine-Gaussian waveforms 



hit) = ylsin(27r/ct-f 0)e"(^)' 



(4.4) 



linearly polarized along the plus-direction, and parame- 
terized by the quality factor Q = v^ttt/c > 1000, where 
A is the peak amplitude, fc is the central frequency, r is 
the 1/e decay time, </> is the phase of the sinusoid rela- 
tive to the gaussian envelope, and to is the time of the 
waveform peak. We choose fc = 92.5 Hz, corresponding 
to the QPO at 92.5 Hz, and choose values of to and Q 
such that all but a negligible fraction of the injected h-^ss 
is contained in the interval [tccntor , tcenter + At] . Plus- 
polarized injections with a given value of h^ss are con- 
structed by first generating a waveform h (t) with func- 
tional form (|4.4I) and A = 1, calculating the energy of 
this waveform, and then rescaling so that the total en- 
ergy is given by F^h'^^^. The injection waveform satisfies 
< Ehi^t) < 



hrss and Eh^t) 



/i^g in the idealized case 
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F+ = l. 

A foreground sampling distribution is constructed by 
making one such injection into each of 108 simulated, 
Gaussian noise segments, where the value 108 is obtained 
by counting the number of electromagnetic SGR 1806—20 
triggers observed during S5 for which enough science- 
quality Hl-Ll coincident data is available to compute 
^£;oxcoss ^-^^^ Ai = 30s nil. The simulated background 
and foreground distributions are then compared using 
the two-sample K-S test. This procedure is performed 
100 times for each choice of h^ss to estimate the percent- 
age of simulation trials in which we can claim detection 
with 5ct significance [11] . We iterate by interpolating the 
resulting injected hrss versus detection percentage curve 
to find the sensitivity, defined to be the minimum in- 
jected hiss such that we can claim detection 50% of the 
time. In Fig. |4] we examine the sensitivity for a Gaus- 
sian noise simulation of the LIGO S5 data set containing 
108 triggers. For simplicity we ignore the detector ori- 
entation and set = Fx = 1 for all triggers. As the 
average value for and Fx is usually around 0.6, this 
simplification underestimates the sensitivity by an aver- 
age factor of ^ 1.67. 



Assuming a detector-source distance of 14kpc for SGR 
1806—20 ^] this corresponds to an isotropic GW emis- 
sion at the source p.3p of 

Fg^ = 1.10 X 10^6 (4 g) 

V. APPLICATION TO OFF-SOURCE 85 DATA 

In Sec. HV] we demonstrated that (12. 9p accurately 
describes the background modeled through simulated, 
Gaussian data. Here we perform a similar analysis using 
the data from LIGO's fifth science run (S5) for the 4-km 
Hanford detector (HI) [2^. We first analyze the effects 
of incorporating the data quality measures described in 
Sec, nil B] to account for deviations from Gaussianity due 
to noise contamination in the LIGO data stream. Then 
we calculate the background AF°'^'^°'''' distribution of HI 
during S5. Finally, we discuss how to extend the al- 
gorithm to perform a multi-detector study utilizing the 
full LIGO network, and optimize the method for SGR 
1806—20 triggers observed by the Hl-Ll network during 
S5. We again choose /qpo = 92.5 Hz, At = 30 s, and 
A/ = 10 Hz, based on observations of the SGR 1806-20 
hyperflare event [23| . 




2.0x10^^^ 3.0x10"^^ 4.0x10"^^ 5.0x10"^" 6.0x10 

Injected Strain (h ) 



FIG. 4: A simulation of the sensitivity for Gaussian white 
noise with strain spectral noise density h = 10~^^ Hz~^''^ . 
The sensitivity is defined to be the minimum injected hiss such 
that the foreground and background distributions are distin- 
guishable via the two-sample K-S test with 5a significance 
50% of the time. We consider a background sampling distri- 
bution with 20,000 samples and a trigger distribution with 
108 samples, and use linearly-polarized sine-Gaussian injec- 
tions. Each injection is performed 100 times assuming ideal 
detector-source orientation to calculate the detection percent- 
age. 

We find that the sensitivity with negligible uncertainty 
for the simulated data set, with trigger times correspond- 
ing to those astrophysically detected by satellites, is 

/i^™'' = 4.15 X 10^22 Hz-i/2_ (-4 5) 



A. S5 Data Quality 

In order to characterize the weighting factor w we 
calculate the energy in each of the three channels — 
Fqpo, Flip, and Fdown — in the time interval [tcontor - 
At, icenter + At] and compute the sampling statistic 



-^up ~t- Fdown 

In figure 5 this measurement is performed for a series of 
times {tccntcr} distributed throughout the S5 run up to 
April 2007. 

We observe that, on average, w < 0.5, consistent with 
the shape of the S5 LIGO noise floor |25J. In addition, 
the large fluctuations in w from sample to sample indi- 
cate the need to determine the weighting factor in the 
excess energy (|2.2p locally for each measurement. In the 
remainder of the paper all AE°^'^°^^ measurements are 
performed using the scheme for determining w described 
in Sec. iHTBl 

We investigate the influence of the data quality mea- 
sures on the shape of the HI background AF'^'^'^^^*' distri- 
bution by performing three trials, corresponding to three 
veto methodologies. In the first no data quality measures 
are applied. In the second we exclude data marked with 
category 1 or category 2 flags. Finally, in the third we 
incorporate data quality flags as well as a transient glitch 
rejection step using 10 ms tiles and a 4cr veto threshold. 
The resulting distributions are summarized in Fig. [51 
We conclude that although the inclusion of DQ flags 
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oretical PDF (|2.9p if we partition the S5 run into two, 
independent sub-runs: the first lasts from the beginning 
of S5 in November 2005 to June 2006, and the second 
lasts from June 2006 to the end of S5 in October 2007. 
The division in June 2006 corresponds to a commission- 
ing break during which the HI detector received a num- 
ber of upgrades. As a result, the detector configurations 
in each of the sub-runs are considerably different, and, in 
the context of our analysis, should be treated as distinctly 
different detector systems. Fig. [7] shows the background 
distributions obtained for each of the two sub-runs. 



Oct05 Jan06 Api-06 Jul06 Oct06 JanOV AprOV 
Time 



FIG. 5: w, defined by (|5.ip . calculated using HI data for a 
series of background times distributed throughout the S5 run 
up to April 2007. The running average is computed at time t 
by averaging all measurements occurring prior to t. 
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FIG. 6: Veto methodology and resulting HI detector back- 
ground using S5 data. In the top figure no data quality mea- 
sures are applied. In the middle figure data marked with 
category 1 or category 2 DQ flags is removed. In the bottom 
figure category 1 and 2 flags are applied as well as a transient 
glitch rejection step (see Sec lIIIBjl using 10 ms tiles and a 4(t 
veto threshold. 



alone goes a long way towards successfully removing 
the loudest outliers, the glitch veto step is also nec- 
essary to fully account for deviations from Gaussian- 
ity and to establish strong agreement between the dis- 
tribution derived from real data and the mathematical 
model/simulations based on gaussian statistics. 



B. HI Detector Background 

We find, after incorporating all of the data quality 
procedures described in Sec. IHIBl that the resulting 
^^cxcoss (distribution is accurately described by the the- 
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FIG. 7: HI detector background (histogram) for each of the 
sub-runs November 2005 - June 2006 and June 2006 - Octo- 
ber 2007 using category 1 and category 2 DQ flags as well as 
a glitch rejection step with 10 ms tiles and a 4cr veto threshold 
(see Sec. lIII B| . For the production of the lower plot, only data 
before April 2007 was used. The solid curves are flts of the 
theoretical PDF (12. 9p . The flt parameters are obtained using 
nonlinear least squares regression to fit the theoretical CDF 
(|2.10p to the empirical CDF of the background distribution. 

Given the agreement between the resulting background 
distributions and the theoretical model (|2.9p . we conclude 
that our data quality methodology is sufficient to account 
for non-Gaussianity arising from noise contamination in 
the S5 data. 



C. Extension to Multi-Detector Networks 

Extending the concept to encompass of the full LIGO 
network requires us to consider how to pool information 
recorded by multiple instruments into a single analysis, 
as well as the orientation of each detector at the times 
of the observed triggers. We extend to a multi-detector 
network by first performing the single-detector analysis 
described in Sec. lHII for each, independent detector. This 
gives a set of p-values representing the significance with 
which the two-sample K-S test can distinguish between 
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the background and foreground distributions in the data 
stream of each detector. Assuming a network of N in- 
dependent detectors, Fisher's method computes the test 
statistic 

N 

X^ = -2^\np, (5.2) 

i=l 

which Jbhows a x^-distribution with 2N degrees of free- 
dom [33| . We claim detection for the detector network if 
is sufficiently large that we can reject the null hypoth- 
esis with 5(T significance. We choose to explore Fisher's 
method over a coherent method for two main reasons: 
(i) Fisher's method does not require us to assume any 
relationship between the waveforms observed in different 
detectors beyond the existence of excess energy in the 
QPO band, and (ii) a significant fraction of triggers will 
only be recorded by one detector because of downtime, 
and coherent methods are not applicable for these trig- 
gers. 

Likewise, we can use Fisher's method to define the sen- 
sitivity for the detector network. We obtain a set of 
antenna factors for a given detector by calculating the 
antenna response function at the time of each trigger 
in the Dominant Polarization Frame (DPF) of that de- 
tector [i^l- For a single detector, this is the frame in 
which the detector response is entirely in the plus direc- 
tion {Fx = 0) [29j]. We then get a multi-detector sensi- 
tivity result by first fixing a value of /irss, and then per- 
forming the single-detector sensitivity analysis described 
in Sec. lIVBl for each detector in the network individually, 
weighting injections with the appropriate set of antenna 
factors. We then combine the single-detector results us- 
ing Fisher's method, and define the sensitivity of the net- 
work to be the minimum injected ft-rss such that we can 
reject the null hypothesis with 5a significance in 50% of 
trials. In the remainder of the paper we will consider the 
Hl-Ll network, where LI is the 4-km Livingston detec- 
tor. 

In addition, we must consider the fact that the de- 
tector network is not necessarily equally sensitive to all 
triggers observed from a single source since the relative 
positions of the detectors and source change with time. 
Fig. shows the quantity F+ for each trigger and for 
HI and LI separately. Since the algorithm becomes more 
sensitive as the number of triggers increases, but is less 
sensitive if the detector is poorly aligned for some of the 
triggers, we expect that there is an optimal subset of the 
full list of triggers for which the sensitivity of a multi 
trigger approach is maximized. We perform this opti- 
mization by repeating the sensitivity analysis using sim- 
ulated, Gaussian noise described in Sec. lIVBj for HI and 
LI individually, weighting the injection waveforms with 
the appropriate antenna factors, and then combining the 
results using Fisher's method to get a sensitivity for the 
hypothetical Hl-Ll network. We then find the worst- 
aligned member (smallest value of of the combined 
list of HI and LI triggers, exclude this trigger for both 



detectors, and compute the sensitivity again. In Fig. |81d 
we plot the sensitivity as a function of the number of 
triggers, obtained by iterating this process. 
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FIG. 8: The upper plot shows the -F+ values for HI and LI 
in their respective Dominant Polarization Frames (DPF-s, see 
[2^) calculated for the direction of the SGR 1806—20 source 
at the times of the astrophysical triggers observed during S5. 
In the lower plot we compute the sensitivity for Gaussian noise 
with strain spectral noise density h = 10~^^ }iz~^^^ as a func- 
tion of the number of triggers included in the analysis with 
the highest corresponding F+ values. In each case we choose 
the number of these triggers such as to maximize sensitivity 
(i.e. minimize h^ss^). 

We conclude that the sensitivity of the algorithm to GW 
emissions from SGR 1806—20 is maximized for the S5 
Hl-Ll data set if we restrict only to the 78 triggers for 
which the detectors were best aligned with the source. 
The same procedure can easily be repeated to tune the 
algorithm for other data sets or sources. Furthermore, 
after repeating the simulation of Sec. IIVBI with the de- 
tector orientation taken into account we find that the 
single detector sensitivities are 

(/i^™«)jj^ = 6.68 X 10-22 Hz-1/2 (5 3) 

(/i^™^)l^ = 6.08 X 10^22 Hz-1/2 (5 4) 

and the sensitivity for the Hl-Ll network obtained using 
Fisher's method is 

(/^?r)Hi-Li = 5.42 X 10-22 jj,-i/2. (5.5) 



10 



Again assuming a detector-source distance of 14 kpc for 
SGR 1806—20, the corresponding isotropic GW emission 
energies are 

{E'Gw)m = 2.84 X 1046 erg (5.6) 
(^^gW)li = 2.35 X 10^6 erg (5.7) 
(^gW)hi-li = 1-87 X 1046 erg (5.8) 

VI. CONCLUSION 

We have investigated aspects of a muhi-trigger data 
analysis concept targeting the detection of long-duration 
narrow-band gravitational wave transients. Potential as- 
trophysical motivators include quasi-periodic oscillations 
of magnetars that are already observed in the electro- 
magnetic spectrum following soft gamma repeater hype- 
flare events. This concept significantly extends a single- 
trigger method introduced by Matone and Marka 
both in data handling sophistication and comprehen- 
sive use of available external information. The method 
is based on an excess-energy statistic targeting long- 
duration, narrow-band signals, and incorporates a num- 
ber of data quality measures to account for nonstation- 
arity in the data stream of a physical detector over long 
periods of time. The detection threshold and sensitivity 
are both defined in terms of the two-sample Kolmogorov- 
Smirnov test as a viable example. 

Analytical results for the PDF and CDF of the detec- 
tor background distribution enabled us to demonstrate 
that these expressions accurately describe the output of 
the pipeline applied to (i) Gaussian white noise and (ii) 
off-source LIGO S5 data from the 4-km Hanford detector 
after reasonable data quality cuts. In turn, they showed 
that the justified data quality cuts are efficient in remov- 
ing the major contamination from the data. It is im- 
portant to note that a useful side product of the method 
might be the ability to identiiy distinct running condition 
of the gravitational wave detectors (for a similar work in 
this topic, see [34]). 

We also demonstrated how to combine the data 
streams of multiple, independent detectors into a single 
study on an alternative non-coherent way, and illustrated 
how to optimize the algorithm for a future search involv- 
ing the S5 Hanford-Livingston data set. Simulating the 
optimal trigger list and using Gaussian white noise with 



strain spectral noise density n = 10~^^ llz~^^^ resulted in 
an upper bound on the total energy emitted in isotropic 
gravitational radiation by SGR 1806—20 in the 30 sec- 
onds following a flare of 

(^gW)hi-li = 1-87 X lO^^erg x (^)' , (6.1) 

where d is the detector-source distance. 

Due to the uncertainties of currently available model 
predictions, throughout our study we assumed that (i) 
the central frequency, bandwidth, and time scale of a 
gravitational wave signal associated with an SGR QPO 
event can directly be taken from the electromagnetic ob- 
servations of the QPO signal, (ii) the duration of the 
gravitational wave signals are the same for every QPO 
event of the same SGR source, and (iii) the duration 
of all the gravitational wave signals associated with the 
astrophysical triggers from an SGR source can be ap- 
proximated with the duration of the longest flare event 
among the triggers (e.g. in case of SGR 1806—20, it is the 
27 December 2004 hyperflare event). These assumptions 
however might turn out to be too optimistic as theoret- 
ical model predictions become more sophisticated in the 
future. Our method can be applied to any chosen cen- 
tral frequency, and can be generalized to a wide scale of 
bandwidths. However, a more detailed study is necessary 
to see the consequences of having different durations for 
the gravitational wave signals emitted during the differ- 
ent flare events of the same SGR source. We leave this 
study to be part of a future work. 
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